Basic Statistics

Raw Counts

Name Value
Rows 336,776
Columns 37
Discrete columns 20
Continuous columns 17
All missing columns 0
Missing observations 473,357
Complete Rows 267,789
Total observations 12,460,712
Memory allocation 78.9 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 7 columns ignored with more than 50 categories.
## dest: 105 categories
## tailnum: 4044 categories
## flight: 3844 categories
## minute: 60 categories
## time_hour: 6936 categories
## model: 128 categories
## name: 102 categories

QQ Plot

## Warning: Removed 289 rows containing non-finite values (stat_qq).
## Warning: Removed 289 rows containing non-finite values (stat_qq_line).

## Warning: Removed 358 rows containing non-finite values (stat_qq).
## Warning: Removed 358 rows containing non-finite values (stat_qq_line).

QQ Plot (by arr_delay)

## Warning: Removed 475 rows containing non-finite values (stat_qq).
## Warning: Removed 475 rows containing non-finite values (stat_qq_line).

## Warning: Removed 231 rows containing non-finite values (stat_qq).
## Warning: Removed 231 rows containing non-finite values (stat_qq_line).

Correlation Analysis

## 9 features with more than 20 categories ignored!
## dest: 100 categories
## tailnum: 3246 categories
## day: 31 categories
## flight: 3773 categories
## minute: 60 categories
## time_hour: 6642 categories
## manufacturer: 25 categories
## model: 121 categories
## name: 100 categories

Principal Component Analysis

## 7 features with more than 50 categories ignored!
## dest: 100 categories
## tailnum: 3246 categories
## flight: 3773 categories
## minute: 60 categories
## time_hour: 6642 categories
## model: 121 categories
## name: 100 categories

Bivariate Distribution

Boxplot (by arr_delay)

## Warning: Removed 197777 rows containing non-finite values (stat_boxplot).

## Warning: Removed 22806 rows containing non-finite values (stat_boxplot).

Scatterplot (by arr_delay)

## Warning: Removed 243 rows containing missing values (geom_point).

## Warning: Removed 243 rows containing missing values (geom_point).

## Warning: Removed 243 rows containing missing values (geom_point).

## Warning: Removed 243 rows containing missing values (geom_point).